Ingesting the Auslan Corpus into the DADA Annotation Store
نویسندگان
چکیده
The DADA system is being developed to support collaborative access to and annotation of language resources over the web. DADA implements an abstract model of annotation suitable for storing many kinds of data from a wide range of language resources. This paper describes the process of ingesting data from a corpus of Australian Sign Language (Auslan) into the DADA system. We describe the format of the RDF data used by DADA and the issues raised in converting the ELAN annotations from the corpus.
منابع مشابه
Creating a Corpus of Auslan within an Australian National Corpus
The creation of signed language (SL) corpora presents special challenges to linguists. They are face-to-face visual-gestural languages that have no widely accepted written forms or standard specialist notation system, making even superficial transcription problematic. SL corpora need to be created taking these facts into account. Using the example of Auslan (Australian Sign Language) this paper...
متن کاملAn RDF Realisation of LAF in the DADA Annotation Server
The Linguistic Annotation Framework defines a generalised graph based model for annotation data intended as an interchange format for transfer of annotations between tools. The DADA system uses an RDF based representation of annotation data and provides a web based annotation store. The annotation model in DADA can be seen as an RDF realisation of the LAF model. This paper describes the relatio...
متن کاملCORILSE: a Spanish Sign Language Repository for Linguistic Analysis
CORILSE is a computerized corpus of Spanish Sign Language (Lengua de Signos Española, LSE). It consists of a set of recordings from different discourse genres by Galician signers living in the city of Vigo. In this paper we describe its annotation system, developed on the basis of pre-existing ones (mostly the model of Auslan corpus). This includes primary annotation of id-glosses for manual si...
متن کاملInteroperable Annotation in the Australian National Corpus
The Australian National Corpus (AusNC) provides a technical infrastructure for collecting and publishing language resources representing Australian language use. As part of the project we have ingested a wide range of resource types into the system, bringing together the different meta-data and annotations into a single interoperable database. This paper describes the initial collections in Aus...
متن کاملReal-time Facial Expressions in the Auslan Tuition System
Facial expressions are an integral part of Australian Sign Language (Auslan). This paper presents the implementation and integration of facial expression into the Auslan Tuition System. To incorporate this feature, we use a 3D mesh and animation of an avatar to dynamically display the Auslan signs. Vertex blending is used to improve the aesthetic quality of this model. The mesh provides a basis...
متن کامل